Dataset statistics
| Number of variables | 12 |
|---|---|
| Number of observations | 9800 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 918.9 KiB |
| Average record size in memory | 96.0 B |
Variable types
| NUM | 10 |
|---|---|
| BOOL | 2 |
Reproduction
| Analysis started | 2022-01-12 16:56:41.832047 |
|---|---|
| Analysis finished | 2022-01-12 16:56:53.609255 |
| Duration | 11.78 seconds |
| Version | pandas-profiling v2.7.1 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
user is highly correlated with df_index | High correlation |
df_index is highly correlated with user | High correlation |
Total_time_spent is highly skewed (γ1 = 42.19411122) | Skewed |
df_index is uniformly distributed | Uniform |
user is uniformly distributed | Uniform |
df_index has unique values | Unique |
user has unique values | Unique |
Live_sessions has 1833 (18.7%) zeros | Zeros |
Replay_sessions has 2980 (30.4%) zeros | Zeros |
Competition has 4464 (45.6%) zeros | Zeros |
Breakouts has 6815 (69.5%) zeros | Zeros |
Avg_ranking has 6876 (70.2%) zeros | Zeros |
| Distinct count | 9800 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4994.696224489796 |
|---|---|
| Minimum | 0 |
| Maximum | 9999 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 76.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 496.95 |
| Q1 | 2497.75 |
| median | 4998.5 |
| Q3 | 7491.25 |
| 95-th percentile | 9497.05 |
| Maximum | 9999 |
| Range | 9999 |
| Interquartile range (IQR) | 4993.5 |
Descriptive statistics
| Standard deviation | 2886.456393 |
|---|---|
| Coefficient of variation (CV) | 0.5779042935 |
| Kurtosis | -1.19990403 |
| Mean | 4994.696224 |
| Median Absolute Deviation (MAD) | 2496.5 |
| Skewness | 0.000169299175 |
| Sum | 48948023 |
| Variance | 8331630.509 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 2047 | 1 | < 0.1% | |
| 3371 | 1 | < 0.1% | |
| 7481 | 1 | < 0.1% | |
| 5432 | 1 | < 0.1% | |
| 9526 | 1 | < 0.1% | |
| 1330 | 1 | < 0.1% | |
| 7473 | 1 | < 0.1% | |
| 5424 | 1 | < 0.1% | |
| 9518 | 1 | < 0.1% | |
| 1322 | 1 | < 0.1% | |
| Other values (9790) | 9790 | 99.9% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 1 | 1 | < 0.1% | |
| 2 | 1 | < 0.1% | |
| 3 | 1 | < 0.1% | |
| 4 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 9999 | 1 | < 0.1% | |
| 9998 | 1 | < 0.1% | |
| 9997 | 1 | < 0.1% | |
| 9996 | 1 | < 0.1% | |
| 9995 | 1 | < 0.1% |
| Distinct count | 9800 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4995.696224489796 |
|---|---|
| Minimum | 1 |
| Maximum | 10000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 76.7 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 497.95 |
| Q1 | 2498.75 |
| median | 4999.5 |
| Q3 | 7492.25 |
| 95-th percentile | 9498.05 |
| Maximum | 10000 |
| Range | 9999 |
| Interquartile range (IQR) | 4993.5 |
Descriptive statistics
| Standard deviation | 2886.456393 |
|---|---|
| Coefficient of variation (CV) | 0.5777886131 |
| Kurtosis | -1.19990403 |
| Mean | 4995.696224 |
| Median Absolute Deviation (MAD) | 2496.5 |
| Skewness | 0.000169299175 |
| Sum | 48957823 |
| Variance | 8331630.509 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 2047 | 1 | < 0.1% | |
| 7457 | 1 | < 0.1% | |
| 9518 | 1 | < 0.1% | |
| 3371 | 1 | < 0.1% | |
| 7465 | 1 | < 0.1% | |
| 5416 | 1 | < 0.1% | |
| 9510 | 1 | < 0.1% | |
| 3363 | 1 | < 0.1% | |
| 1314 | 1 | < 0.1% | |
| 5408 | 1 | < 0.1% | |
| Other values (9790) | 9790 | 99.9% |
| Value | Count | Frequency (%) | |
| 1 | 1 | < 0.1% | |
| 2 | 1 | < 0.1% | |
| 3 | 1 | < 0.1% | |
| 4 | 1 | < 0.1% | |
| 5 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 10000 | 1 | < 0.1% | |
| 9999 | 1 | < 0.1% | |
| 9998 | 1 | < 0.1% | |
| 9997 | 1 | < 0.1% | |
| 9996 | 1 | < 0.1% |
Grade
Real number (ℝ≥0)
| Distinct count | 13 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.986938775510204 |
|---|---|
| Minimum | 1 |
| Maximum | 13 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 76.7 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 9 |
| median | 11 |
| Q3 | 12 |
| 95-th percentile | 12 |
| Maximum | 13 |
| Range | 12 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.274290662 |
|---|---|
| Coefficient of variation (CV) | 0.2277265049 |
| Kurtosis | 1.558274523 |
| Mean | 9.986938776 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -1.302398315 |
| Sum | 97872 |
| Variance | 5.172398016 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 12 | 3236 | 33.0% | |
| 11 | 2044 | 20.9% | |
| 10 | 1254 | 12.8% | |
| 9 | 854 | 8.7% | |
| 8 | 820 | 8.4% | |
| 7 | 784 | 8.0% | |
| 6 | 263 | 2.7% | |
| 5 | 178 | 1.8% | |
| 4 | 140 | 1.4% | |
| 13 | 74 | 0.8% | |
| Other values (3) | 153 | 1.6% |
| Value | Count | Frequency (%) | |
| 1 | 71 | 0.7% | |
| 2 | 25 | 0.3% | |
| 3 | 57 | 0.6% | |
| 4 | 140 | 1.4% | |
| 5 | 178 | 1.8% |
| Value | Count | Frequency (%) | |
| 13 | 74 | 0.8% | |
| 12 | 3236 | 33.0% | |
| 11 | 2044 | 20.9% | |
| 10 | 1254 | 12.8% | |
| 9 | 854 | 8.7% |
Active_subjects
Real number (ℝ≥0)
| Distinct count | 18 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.206734693877551 |
|---|---|
| Minimum | 1 |
| Maximum | 18 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 76.7 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 8 |
| Maximum | 18 |
| Range | 17 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.262522399 |
|---|---|
| Coefficient of variation (CV) | 0.7055533477 |
| Kurtosis | 2.236484534 |
| Mean | 3.206734694 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.360047175 |
| Sum | 31426 |
| Variance | 5.119007604 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 1 | 2595 | 26.5% | |
| 2 | 2196 | 22.4% | |
| 3 | 1503 | 15.3% | |
| 4 | 1258 | 12.8% | |
| 5 | 715 | 7.3% | |
| 6 | 577 | 5.9% | |
| 7 | 433 | 4.4% | |
| 8 | 233 | 2.4% | |
| 9 | 134 | 1.4% | |
| 10 | 65 | 0.7% | |
| Other values (8) | 91 | 0.9% |
| Value | Count | Frequency (%) | |
| 1 | 2595 | 26.5% | |
| 2 | 2196 | 22.4% | |
| 3 | 1503 | 15.3% | |
| 4 | 1258 | 12.8% | |
| 5 | 715 | 7.3% |
| Value | Count | Frequency (%) | |
| 18 | 1 | < 0.1% | |
| 17 | 2 | < 0.1% | |
| 16 | 3 | < 0.1% | |
| 15 | 3 | < 0.1% | |
| 14 | 6 | 0.1% |
| Distinct count | 6138 |
|---|---|
| Unique (%) | 62.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 429.9221751780001 |
|---|---|
| Minimum | 0.016666667 |
| Maximum | 224290.7167 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 76.7 KiB |
Quantile statistics
| Minimum | 0.016666667 |
|---|---|
| 5-th percentile | 0.5 |
| Q1 | 5.05 |
| median | 38.83333333 |
| Q3 | 226.6583333 |
| 95-th percentile | 1535.171666 |
| Maximum | 224290.7167 |
| Range | 224290.7 |
| Interquartile range (IQR) | 221.6083333 |
Descriptive statistics
| Standard deviation | 3335.934656 |
|---|---|
| Coefficient of variation (CV) | 7.75939193 |
| Kurtosis | 2395.869405 |
| Mean | 429.9221752 |
| Median Absolute Deviation (MAD) | 37.76666667 |
| Skewness | 42.19411122 |
| Sum | 4213237.317 |
| Variance | 11128460.03 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0.25 | 31 | 0.3% | |
| 0.316666667 | 29 | 0.3% | |
| 0.35 | 25 | 0.3% | |
| 0.816666667 | 25 | 0.3% | |
| 0.233333333 | 25 | 0.3% | |
| 0.3 | 24 | 0.2% | |
| 0.4 | 22 | 0.2% | |
| 0.216666667 | 22 | 0.2% | |
| 1.033333333 | 21 | 0.2% | |
| 0.566666667 | 20 | 0.2% | |
| Other values (6128) | 9556 | 97.5% |
| Value | Count | Frequency (%) | |
| 0.016666667 | 10 | 0.1% | |
| 0.033333333 | 9 | 0.1% | |
| 0.05 | 10 | 0.1% | |
| 0.066666667 | 6 | 0.1% | |
| 0.083333333 | 11 | 0.1% |
| Value | Count | Frequency (%) | |
| 224290.7167 | 1 | < 0.1% | |
| 123917.0667 | 1 | < 0.1% | |
| 90597.83333 | 1 | < 0.1% | |
| 89434.56667 | 1 | < 0.1% | |
| 65437.33333 | 1 | < 0.1% |
| Distinct count | 209 |
|---|---|
| Unique (%) | 2.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.96561224489796 |
|---|---|
| Minimum | 0 |
| Maximum | 997 |
| Zeros | 1833 |
| Zeros (%) | 18.7% |
| Memory size | 76.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 8 |
| 95-th percentile | 40 |
| Maximum | 997 |
| Range | 997 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 40.11378781 |
|---|---|
| Coefficient of variation (CV) | 3.658143924 |
| Kurtosis | 225.2753668 |
| Mean | 10.96561224 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 12.8571329 |
| Sum | 107463 |
| Variance | 1609.115972 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 1 | 1974 | 20.1% | |
| 0 | 1833 | 18.7% | |
| 2 | 1111 | 11.3% | |
| 3 | 760 | 7.8% | |
| 4 | 552 | 5.6% | |
| 5 | 427 | 4.4% | |
| 6 | 290 | 3.0% | |
| 7 | 254 | 2.6% | |
| 8 | 229 | 2.3% | |
| 9 | 187 | 1.9% | |
| Other values (199) | 2183 | 22.3% |
| Value | Count | Frequency (%) | |
| 0 | 1833 | 18.7% | |
| 1 | 1974 | 20.1% | |
| 2 | 1111 | 11.3% | |
| 3 | 760 | 7.8% | |
| 4 | 552 | 5.6% |
| Value | Count | Frequency (%) | |
| 997 | 2 | < 0.1% | |
| 958 | 1 | < 0.1% | |
| 941 | 1 | < 0.1% | |
| 733 | 1 | < 0.1% | |
| 732 | 1 | < 0.1% |
| Distinct count | 81 |
|---|---|
| Unique (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.3148979591836736 |
|---|---|
| Minimum | 0 |
| Maximum | 380 |
| Zeros | 2980 |
| Zeros (%) | 30.4% |
| Memory size | 76.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 3 |
| 95-th percentile | 12 |
| Maximum | 380 |
| Range | 380 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 8.350656569 |
|---|---|
| Coefficient of variation (CV) | 2.519129298 |
| Kurtosis | 495.6122341 |
| Mean | 3.314897959 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 15.35871359 |
| Sum | 32486 |
| Variance | 69.73346514 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 2980 | 30.4% | |
| 1 | 2331 | 23.8% | |
| 2 | 1331 | 13.6% | |
| 3 | 819 | 8.4% | |
| 4 | 526 | 5.4% | |
| 5 | 379 | 3.9% | |
| 6 | 269 | 2.7% | |
| 7 | 184 | 1.9% | |
| 8 | 160 | 1.6% | |
| 9 | 110 | 1.1% | |
| Other values (71) | 711 | 7.3% |
| Value | Count | Frequency (%) | |
| 0 | 2980 | 30.4% | |
| 1 | 2331 | 23.8% | |
| 2 | 1331 | 13.6% | |
| 3 | 819 | 8.4% | |
| 4 | 526 | 5.4% |
| Value | Count | Frequency (%) | |
| 380 | 1 | < 0.1% | |
| 164 | 1 | < 0.1% | |
| 163 | 1 | < 0.1% | |
| 144 | 1 | < 0.1% | |
| 141 | 1 | < 0.1% |
| Distinct count | 131 |
|---|---|
| Unique (%) | 1.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.25265306122449 |
|---|---|
| Minimum | 0 |
| Maximum | 491 |
| Zeros | 4464 |
| Zeros (%) | 45.6% |
| Memory size | 76.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 3 |
| 95-th percentile | 16 |
| Maximum | 491 |
| Range | 491 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 17.7444596 |
|---|---|
| Coefficient of variation (CV) | 4.172562245 |
| Kurtosis | 298.0944892 |
| Mean | 4.252653061 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 14.67571715 |
| Sum | 41676 |
| Variance | 314.8658466 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 4464 | 45.6% | |
| 1 | 1838 | 18.8% | |
| 2 | 865 | 8.8% | |
| 3 | 522 | 5.3% | |
| 4 | 347 | 3.5% | |
| 5 | 270 | 2.8% | |
| 6 | 193 | 2.0% | |
| 7 | 165 | 1.7% | |
| 8 | 150 | 1.5% | |
| 9 | 110 | 1.1% | |
| Other values (121) | 876 | 8.9% |
| Value | Count | Frequency (%) | |
| 0 | 4464 | 45.6% | |
| 1 | 1838 | 18.8% | |
| 2 | 865 | 8.8% | |
| 3 | 522 | 5.3% | |
| 4 | 347 | 3.5% |
| Value | Count | Frequency (%) | |
| 491 | 1 | < 0.1% | |
| 489 | 1 | < 0.1% | |
| 446 | 1 | < 0.1% | |
| 418 | 1 | < 0.1% | |
| 405 | 1 | < 0.1% |
| Distinct count | 106 |
|---|---|
| Unique (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.614183673469388 |
|---|---|
| Minimum | 0 |
| Maximum | 516 |
| Zeros | 6815 |
| Zeros (%) | 69.5% |
| Memory size | 76.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 11 |
| Maximum | 516 |
| Range | 516 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 13.68798665 |
|---|---|
| Coefficient of variation (CV) | 5.236046261 |
| Kurtosis | 514.4417667 |
| Mean | 2.614183673 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 18.53287383 |
| Sum | 25619 |
| Variance | 187.3609785 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 6815 | 69.5% | |
| 1 | 908 | 9.3% | |
| 2 | 468 | 4.8% | |
| 3 | 300 | 3.1% | |
| 4 | 223 | 2.3% | |
| 5 | 153 | 1.6% | |
| 6 | 124 | 1.3% | |
| 8 | 92 | 0.9% | |
| 7 | 87 | 0.9% | |
| 10 | 61 | 0.6% | |
| Other values (96) | 569 | 5.8% |
| Value | Count | Frequency (%) | |
| 0 | 6815 | 69.5% | |
| 1 | 908 | 9.3% | |
| 2 | 468 | 4.8% | |
| 3 | 300 | 3.1% | |
| 4 | 223 | 2.3% |
| Value | Count | Frequency (%) | |
| 516 | 1 | < 0.1% | |
| 499 | 1 | < 0.1% | |
| 357 | 1 | < 0.1% | |
| 301 | 1 | < 0.1% | |
| 266 | 1 | < 0.1% |
| Distinct count | 1483 |
|---|---|
| Unique (%) | 15.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.4080278227341834 |
|---|---|
| Minimum | 0.0 |
| Maximum | 43.0 |
| Zeros | 6876 |
| Zeros (%) | 70.2% |
| Memory size | 76.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 7.25 |
| Maximum | 43 |
| Range | 43 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 3.206991416 |
|---|---|
| Coefficient of variation (CV) | 2.27764776 |
| Kurtosis | 23.88659153 |
| Mean | 1.408027823 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.971779487 |
| Sum | 13798.67266 |
| Variance | 10.28479394 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 6876 | 70.2% | |
| 1 | 578 | 5.9% | |
| 4 | 52 | 0.5% | |
| 5 | 39 | 0.4% | |
| 2 | 34 | 0.3% | |
| 3 | 28 | 0.3% | |
| 9 | 28 | 0.3% | |
| 7 | 26 | 0.3% | |
| 8 | 26 | 0.3% | |
| 6 | 25 | 0.3% | |
| Other values (1473) | 2088 | 21.3% |
| Value | Count | Frequency (%) | |
| 0 | 6876 | 70.2% | |
| 1 | 578 | 5.9% | |
| 1.04 | 1 | < 0.1% | |
| 1.052631579 | 1 | < 0.1% | |
| 1.0625 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 43 | 1 | < 0.1% | |
| 40.5 | 1 | < 0.1% | |
| 37 | 1 | < 0.1% | |
| 34.30769231 | 1 | < 0.1% | |
| 33.35294118 | 1 | < 0.1% |
Is_Converted
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 76.7 KiB |
| 0 | |
|---|---|
| 1 | 303 |
| Value | Count | Frequency (%) | |
| 0 | 9497 | 96.9% | |
| 1 | 303 | 3.1% |
Is_Activated
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 76.7 KiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 5277 | 53.8% | |
| 1 | 4523 | 46.2% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| df_index | user | Grade | Active_subjects | Total_time_spent | Live_sessions | Replay_sessions | Competition | Breakouts | Avg_ranking | Is_Converted | Is_Activated | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 1 | 6 | 1 | 83.300000 | 2 | 1 | 0 | 2 | 3.857143 | 0 | 1 |
| 1 | 1 | 2 | 12 | 1 | 13.800000 | 0 | 1 | 0 | 0 | 0.000000 | 0 | 0 |
| 2 | 2 | 3 | 12 | 1 | 55.183333 | 0 | 2 | 0 | 0 | 0.000000 | 0 | 0 |
| 3 | 3 | 4 | 12 | 1 | 7.216667 | 0 | 0 | 2 | 0 | 0.000000 | 0 | 0 |
| 4 | 4 | 5 | 9 | 2 | 0.250000 | 0 | 1 | 0 | 0 | 0.000000 | 0 | 0 |
| 5 | 5 | 6 | 10 | 5 | 6.566667 | 1 | 1 | 1 | 0 | 0.000000 | 0 | 0 |
| 6 | 6 | 7 | 12 | 1 | 3063.483333 | 19 | 35 | 1 | 13 | 5.064516 | 1 | 1 |
| 7 | 7 | 8 | 11 | 2 | 64.700000 | 0 | 1 | 0 | 0 | 0.000000 | 0 | 1 |
| 8 | 8 | 9 | 12 | 1 | 1.900000 | 0 | 1 | 0 | 0 | 0.000000 | 0 | 0 |
| 9 | 9 | 10 | 11 | 2 | 3.283333 | 2 | 0 | 0 | 0 | 0.000000 | 0 | 0 |
Last rows
| df_index | user | Grade | Active_subjects | Total_time_spent | Live_sessions | Replay_sessions | Competition | Breakouts | Avg_ranking | Is_Converted | Is_Activated | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 9790 | 9990 | 9991 | 11 | 1 | 51.366667 | 1 | 1 | 0 | 1 | 1.000000 | 0 | 0 |
| 9791 | 9991 | 9992 | 12 | 3 | 314.483333 | 5 | 1 | 13 | 4 | 5.227273 | 0 | 1 |
| 9792 | 9992 | 9993 | 11 | 1 | 0.133333 | 0 | 1 | 0 | 0 | 0.000000 | 0 | 0 |
| 9793 | 9993 | 9994 | 12 | 3 | 3.916667 | 0 | 3 | 0 | 0 | 0.000000 | 0 | 0 |
| 9794 | 9994 | 9995 | 12 | 3 | 1859.600000 | 20 | 12 | 14 | 6 | 2.344828 | 0 | 1 |
| 9795 | 9995 | 9996 | 12 | 2 | 106.200000 | 0 | 3 | 0 | 0 | 0.000000 | 0 | 1 |
| 9796 | 9996 | 9997 | 10 | 4 | 68.316667 | 3 | 0 | 1 | 3 | 1.000000 | 0 | 1 |
| 9797 | 9997 | 9998 | 12 | 2 | 175.466667 | 0 | 6 | 0 | 0 | 0.000000 | 0 | 1 |
| 9798 | 9998 | 9999 | 11 | 1 | 1.400000 | 0 | 2 | 0 | 0 | 0.000000 | 0 | 0 |
| 9799 | 9999 | 10000 | 11 | 3 | 29.550000 | 0 | 2 | 1 | 0 | 0.000000 | 0 | 0 |